
SPECIFICATION AMENDMENTS 

Please replace the below identified paragraphs with the following amended and newly 
added paragraphs. Note that as a result of extensive changes to the specification, and in 
accordance with 37 C.F.R § 1.125(b), a substitute specification incorporating the changes 
identified below is provided along with this amendment. No new matter is included in either 
the amendments or in the substitute specification. 

1. Please replace the paragraph on page 2, line 22 through page 3, line 2 with the 
following amended paragraph: 

To overcome the limitations in the prior art described above, and to 
overcome other limitations that will become apparent upon reading and 
understanding the present specification, the present invention is embodied in a 
system and method for statistically comparing a first set of digital data to at least a 
second set of digital data and matching the first set of digital data to appropriately 
corresponding portions of the second set of digital data. ¥ke =ln one embodiment 
the first or the second set of digital data can bo ^transformed during statistical 
analysis to enhance statistical analysis of the digital data. 

2. Please replace the paragraph on page 3, line 3-1 7 with the following amended 
paragraph: 

In one embodiment of the present invention, the system includes a host 
processor executing software that implements an address generator, an acceptance 
tester and a statistical comparison processor. The host processor controls the 
entire process and initially renders or rasterizes the sets of data. The address 
generator generates addresses, which can reflect a transformation, for the first set 
of data and the second set of data to be compared. The addresses are used by 
filtering functions to generate per-pixel values, such as color values. The 
acceptance tester receives the per-pixel values for performing various conventional 
pixel tests such as, for example, an alpha test, depth buffer tests, scissor tests, 
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stencil tests, blending, dithering, logical operations, etc., and then determines which 
fee pixels feat are to be used to contribute to the statistical analysis based on the 
results of one or more of the acceptance tests . The statistical comparison 
processor statistically analyzes the pixels between the first data set and the second 
data set for comparison purposes. The host processor then examines the statistical 
comparisons computed by the statistical comparison processor and makes further 
processing decisions. The process repeats until a desired result is computed, such 
as a match or non-match between the data sets. 

3. Please insert the following three newly added paragraphs on page 3, after the 
paragraph ending on line 17 of page 3: 

As is well known to those skilled in the art, conventional graphics processors 
or graphics rendering devices, including computer graphics cards and the like, are 
capable of being programmed to perform any number of functions other than simply 
processing pixels for display on a computer display device. For example, 
conventional raster graphics implementations typically have a number of buffers 
with a depth of 16, 32, or more bits per pixel. In general, each pixel can be 
considered to be a data element upon which the graphics hardware operates. This 
allows a single graphics language instruction executed by the graphics processor to 
operate on multiple data elements. 

Since the bits associated with each pixel can be allocated to one to four 
components, a raster image can be interpreted as a scalar or vector valued function 
defined on a discrete rectangular domain in the x-y plane. For example, the 
luminance value of a pixel can represent the value of the function while the position 
of the pixel in the image represents the position in the x-y plane. 

Alternatively, an RGB or RGB A image can represent a three or four 
dimensional vector field defined over a subset of the plane. Consequently, highly 
parallelized calculations can be performed on entire functions or vector fields 
directly within the graphics hardware of a conventional computer graphics card or 
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the like. Such uses of conventional graphics hardware for performing these types of 
computations have been well known to those skilled in the art for a number of years. 

4. Please replace the paragraph on page 3, line 18, through page 4, line 2, with the 
following amended paragraph: 

Alternately Therefore, in an alternate embodiment, t he system can bo is 
implemented in a three-dimensional (3D) graphics rasterizer or rendering device, 
such as, for example, a conventional computer graphics card or processor which 
has been modified to include the aforementioned statistical comparison processor . 
In this embodiment, the system includes a frame buffer (a block of graphics memory 
that represents the display screen) and texture memory (a block of graphics 
memory that can contain portions of the display screen), in addition to the 
components discussed above (i.e.. the address generator, the acceptance tester, 
and the statistical comparison processor). In one embodiment, the The first set of 
digital data can bo is stored in the frame buffer while the second set of data can bo 
is stored in the texture memory. Also, statistical generation can bo |s performed by 
the statistical comparison processor included with the modified rasterizer, with or 
without actually rendering or writing a 3D digital scene comprised of the digital data 
to the frame buffer. In this embodiment, rasterization and rendering techniques and 
advanced statistical generation and comparison of the present invention can bo a re 
integrated to form a novel video graphics device or hardware video card for 
computer systems. 

5. Please insert the following two newly added paragraphs on page 4, after the 
paragraph ending on page 4, line 2: 

Thus, in one embodiment, the system and method described herein 
compares and matches a first set of digital data to a second set of digital data. 
Further, during a raster transformation of the first and second sets of digital data, 
multiple images of the digital data are placed in texture memory as multiple 
textures. Then, statistics are gathered concerning the textures, and the raster 
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transformed sets of digital data are compared and matched against corresponding 
portions of each other. In this context, the system is described as being 
implemented in a computer system including a host processor and a modified 
graphics processor including a texture memory for textures, and a graphics 
processing chip having an address generator, an acceptance tester and a statistics 
comparison device. 

The statistical comparisons and matching processing are provided as part of 
the rasterization pipeline of the graphics processor, and the data extracted from the 
processes are then recorded for normalized correlations or variations, and for 
subsequent forwarding to the host processor or alternate processing system for 
further processing or storage, as desired. Further, as the data is passed through 
the rasterization pipeline of the graphics processor, statistics between the textures 
are gathered and processed via the statistical comparison processor. 

6. Please replace the paragraph on page 8, lines 1-10 with the following amended 
paragraph: 

In general, the present invention is embodied in a system and method for 
statistically analyzing and comparing a first group of pixels of a defined portion of a 
digital scene, such as an object or template within the digital scene, to a second 
group of pixels, such as the entire digital scene or the image. The template is 
matched to appropriately corresponding portions of the image that represent the 
template. QsftS€ Hn one embodiment, during statistical comparison and matching of 
the template and the image, either the first or the second group of pixels can bo is 
raster transformed. For instance, either the template or the image can bo is 
incrementally rotated , scaled, or skewed to enhance the statistical analyses. 
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7. Please replace the paragraph on 8, lines 11-19 with the following amended 
paragraph: 

In particular, first, the system receives digital input, such as the images or the 
templates. ms ^Further, in one embodiment this data em then tee= pre-processed, 
using a variety of techniques such as filtering, equalization, and edge or feature 
finding. The system then computes statistics on either transformed images or 
transformed templates. N ext, the resulting computed statistics are analyzed and 
new requests based on the resulting statistics are generated with different 
transformations and/or different images until a desired result is computed, namely a 
match or non-match between the template and the image. 

8. Please replace the paragraph on page 8, line 20, through page 9, line 2 with the 
following amended paragraph: 

¥ke =ln one embodiment the present invention can bo is used as an object 
detection and tracking system for computer vision, motion analysis and pattern 
recognition applications, as a video acceleration system for computer graphics video 
hardware cards, as a video CODEC (coder/decoder), or any other suitable device that 
can uti li zo utilizes statistical comparison and matching between sets of digital data, 
such as images. In addition, in one embodiment, t he present invention can bo is 
implemented as computer software running on a computer system, as instruction sets 
operating within a microprocessor for a hardware device, such as a computer graphics 
video card, as computer firmware operating within a digital device, such as a digital 
camera, or any other suitable digital processing system. 

9. Please insert the following seven newly added paragraphs on page 9, after the 
paragraph ending on page 9, line 2: 

As is well known to those skilled in the art, conventional graphics processors 
or graphics rendering devices, including computer graphics cards and the like, are 
capable of being programmed to perform any number of functions other than simply 
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processing pixels for display on a computer display device. For example, 
conventional raster graphics implementations typically have a number of buffers 
with a depth of 16, 32, or more bits per pixel. In general, each pixel is considered to 
be a data element upon which the graphics hardware operates. This allows a single 
graphics language instruction executed by the graphics processor to operate on 
multiple data elements. 

Since the bits associated with each pixel can be allocated to one to four 
components, a raster image can be interpreted as a scalar or vector valued function 
defined on a discrete rectangular domain in the x-y plane. For example, the 
luminance value of a pixel can represent the value of the function while the position 
of the pixel in the image represents the position in the x-y plane. Alternatively, color 
(and alpha) values of an RGB (or RGBA) image can represent a three or four 
dimensional vector field defined over a subset of the plane. Consequently, highly 
parallelized calculations can be performed on entire functions or vector fields 
directly within the graphics hardware of a conventional computer graphics card or 
the like. Such uses of conventional graphics hardware for performing these types of 
computations have been well known to those skilled in the art for a number of years. 

A conventional graphics processor includes a number of functional 
components including, for example, a frame buffer, texture memory, an address 
generator, and an "acceptance tester". Note that the term acceptance tester is used 
here to encompass any of the conventional per-pixel tests which are typically 
employed by conventional graphics cards, in combination with a determination as to 
whether particular pixels will contribute to statistical computations. 

For example, among the per-pixel tests performed by the acceptance tester 
in one embodiment is a conventional "alpha test" for checking to see whether a pixel 
being drawn has a 0 alpha. With conventional graphics cards, a pixel having an 
alpha value of 0 is simply skipped rather than being drawn. However, in the context 
of the present invention, since there are two different pixels that are being 
compared, the comparison is skipped if either pixel has a 0 alpha value. Another 
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conventional per-pixel test performed by the acceptance tester in one embodiment 
is a "depth buffer" test for determining whether the current pixel being drawn is 
visible, based on its current z-buffer value. In the context of the present invention, 
this test is used in one embodiment to only compare a template against the visible 
portion of a 3D scene. Other conventional per-pixel acceptance tests and 
operations include scissor tests, stencil tests, blending, dithering, logical operations, 
etc., as described in "The OpenGL® Graphics System: A Specification (Version 
1.2.1)" Copyright® 1992-1999 Silicon Graphics, Inc., April 1, 1999, the subject 
matter of which is incorporated herein by this reference. 

In addition, many such graphics processors also include an alpha blending 
device. Such components are well known to those skilled in the art, and will only be 
described in general below. However, in one embodiment conventional graphics 
processors, such as a conventional computer graphics card or the like is modified to 
include a statistics comparison device for computing statistical information directly 
from information existing in the rendering pipeline of the graphics processor. 

Thus, in one embodiment, the system and method described herein 
compares and matches a first set of digital data to a second set of digital data. 
Further, during a raster transformation of the first and second sets of digital data, 
multiple images of the digital data are placed in texture memory as multiple 
textures. Then, statistics are gathered concerning the textures, and the raster 
transformed sets of digital data are compared and matched against corresponding 
portions of each other. In this context, the system is described as being 
implemented in a computer system including a host processor and a modified 
graphics processor including a texture memory for textures, and a graphics 
processing chip having an address generator, an acceptance tester (for performing 
conventional per-pixel testing, as described above) and a statistics comparison 
device. 

The statistical comparisons and matching processing are provided as part of 
the rasterization pipeline of the graphics processor, and the data extracted from the 
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processes are then recorded for normalized correlations or variations, and for 
subsequent forwarding to the host processor or alternate processing system for 
further processing or storage, as desired. Further, as the data is passed through 
the rasterization pipeline of the graphics processor, statistics between the textures 
are gathered and processed via the statistical comparison processor. 

10. Please replace the paragraph on page 9, lines 2-15 with the following amended 
paragraph: 

FIG. 2 is an overview flow diagram of the system and method of the present 
invention. In general, the present invention matches a first set of digital data to a 
second set of digital data by statistically comparing the sets of data. Namely, first the 
system receives a first set of digital data , such as a template, as a first inpu t, wh i ch 
can bo a tomplato , and/or receives a second set of digital data , such as an image. 
as a second input , wh i ch can bo an i mago (step 210). Next, the system 200 raster 
transforms either the first or the second set of digital data and computes statistics 
on the transformation (step 212). The system then analyzes the resulting statistics 
and makes calculated determinations based on the resulting statistics for generating 
new and different transformations on the data (step 214) until a desired result is 
achieved, such as a match or non-match between the first and the second set of 
digital data (step 216) or between other sets of data. 

1 1 . Please insert the following newly added paragraph on page 9, before the paragraph 
beginning on page 9, line 18: 

It has been observed that textured triangle rasterization performed in a 
conventional graphics processor or the like closely resembles sparse matching of a 
template with an image. In support of this observation, the following discussion will 
include a brief overview of conventional rendering techniques as known to those 
skilled in the art. As described herein, these rendering techniques have been adapted 
for the purpose of template matching. For example, as is well known to those skilled 
in the art, triangle rasterization performed using conventional graphics processors 
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involves fetching a set of pixels arranged in a regular order in one or more subsets 
of graphics memory (the source texture maps), combining or operating on these 
values, and then drawing these into a frame buffer. (See for example, "The 
OpenGL® Graphics System: A Specification (Version 1.2.1)" Copyright® 1992-1999 
Silicon Graphics, Inc., April 1, 1999). Similarly, sparse template matching involves 
fetching two regular subsets of graphics memory and then comparing the values to 
accumulate some statistics. Therefore, the only difference between traditional 
graphics rasterization (rendering) and sparse template matching is the statistical 
comparison of pixels and the accumulation of these statistics, as described below. 
Therefore, an understanding of conventional triangle rasterization will enable those 
skilled in the art to fully understand sparse matching of a template with an image as 
described herein. 

12. Please replace the paragraph on page 9, line 18 through page 10, line 2 with the 
following amended paragraph: 

FIG. 3 is a flow diagram of the operation of the present invention and FIG. 4 is 
a general block diagram of the present invention. Referring to FIG. 3 along with FIGS. 
4 and 2, first, new sets of data, such as an image and/or a template is acquired (step 
310) by the system 400 and initialized by the host processor 408. ¥ke =ln one 
embodiment the host processor 408 can otoro then stores the new sets of data in the 
memory devices. For instance, the first set of data, such as the template can bo js 
loaded into a first memory device 412 and the second set of data, such as the image 
can be loaded into a second memory device 410. Second, models, such as two- 
dimensional (2D) or three-dimensional (3D) models, are rendered and statistics are 
accumulated (step 31 2) by the host processor 408 for the template and the image- 
Rendering and statistical accumulation can bo is accomplished using a 
statistics/comparison device 418 in combination with an address generator 414 and an 
acceptance tester 416, wh i ch wi ll bo as discussed in detail below. 
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13. Please replace the paragraph on page 10, lines 3-18 with the following amended 
paragraph: 

Rendering using conventional graphics processors or the like typically 
involves drawing geometric shapes or primitives, such as polygons, into dedicated 
memory. It should be noted that the present invention preferably uses triangles as 
the drawing primitive, although there are other primitive types that could be used. In 
general, in a simplified example of rasterization provided for purposes of 
explanation, a single triangle can bo is rendered by taking three vertices v 0 , v<i, V2 
with the following fields sx L sy (the screen space coordinates of the triangle in the 
first memory device) tu L tv L rhw L (the 2D coordinates of each vertex in the texture, 
and a perspective term). fte= ln particular, the address generator 414 of a 
conventional graphics processor interpolates these parameters (vh. v±* v?, etc.) 
across the triangle; for each pixel in the first memory device subtended by the 
triangle in screen space (sx sy), pixel values in t he second memory device 4S=are 
used by the address generator 414 to compute an interpolated texture value at the 
corresponding interpolated texture location. Tho comparison otat i otico botwoon tho 
two co l oro aro thon gathered, depend i ng on tho results of tho accoptanco toot. Th i o 
operat i on can bo conoidorod as a resampling operation boing app li ed to tho toxturo, 
which i nvolves f il tering tho tocturo at d i fferent locotiono and at different donoit i oo. 

14. Please insert the following two newly added paragraphs on page 10, after the 
paragraph ending on page 10, line 18: 

Note that the operations described in the preceding paragraph are equivalent 
to a conventional resampling operation being applied to the texture, which involves 
filtering t he texture a t d ifferent I ocations a nd a t d ifferent d ensities. F or example, 
such resampling operations are d escribed in the context of conventional "texture 
minification" and "texture magnification" operations in Section 3.8.5 and 3.8.6 of the 
aforementioned "The OpenGL® Graphics System: A Specification (Version 1.2.1)" 
Copyright® 1992-1999 Silicon Graphics, Inc., April 1, 1999. 
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The present invention builds on these resampling operations by gathering 
comparison statistics between the RGB color values of the corresponding pixels 
between the two memory devices 410 and 412 using the statistics/comparison 
device 418, depending on the results of the acceptance test performed by the 
acceptance tester 41 6. For example, as noted above, if the alpha or z-buffer values 
for particular pixels indicate that those pixels are not visible (e.g., a 0 alpha value), 
then statistics will not be gathered by the statistics/comparison device 418 for those 
pixels. 

15. Please replace the paragraph on page 11, lines 1-10 with the following amended 
paragraph: 

In general, the address generator 414 generates addresses for the template 
and the image that are to be compared. Thooo The addresses €sa=reflect either the 
template or image, or a transformation, such as combinations of rotations, scales and 
perspective transforms, of the template or image. The addresses serve as input to 
filtering functions that read from the images to be compared and generate color 
values (RGBA) and, if present, Z buffer and other per-pixel values. Thooo When 
present, these values can bo are used by the acceptance tester 416 to decide 
whether to allow the pixel to contribute to the statistics. If the pixel is permitted to 
contribute, the color values are sent to a statistics/comparison device 418 for 
statistical analyses and comparison processing. For example, as noted above, if 
the alpha or z-buffer values of a pixel indicate that the pixel would not be visible, 
then the pixel will not be allowed to contribute to the statistical comparison. 

16. Please replace the paragraph on page 1 1 , lines 11-21 with the following amended 
paragraph: 

The statistics/comparison device 418 can conta i n contains variables that are 
updated for each pixel based on the input color values. For instance, in one 
embodiment of the present invention, statistical analyses for comparing and 
matching the sets of digital data can bo is accomplished by initially defining a 
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function or metric within the statistics/comparison device 418 that estimates the 
similarity between the sets of digital data. In this case, one set of digital data is the 
template and the other set of digital data is the image. Tke -Further. the t emplate 
can represent an object of the image that is being tracked and detected (located) 
within the image. The object can bo is then located in the image by computing the 
metric at various locations (u, v) in the image and determining where the metric is 
maximized. 

17. Please replace the paragraph on page 11, line 22, through page 12, line 4 with the 
following amended paragraph: 

In the following examples, T represents the template image and I represents 
the input image. Ono motr i c that can bo uood io In one embodiment, the 
statistics/comparison device 418 uses a cross-correlation coefficient metric for 
measuring which moaouroo the similarity between the image and the template on 
an absolute scale in the range [-1, 1], namely: 

covariancejl, T) = L< Z, ( T ( x > ^ ~ Vt X*(* ~ W ~ v ) " M/ ) 

where fii and a/ designate the mean and standard deviation of the image and /ir and 
oj designate the mean and standard deviation of the template. 

18. Please replace the paragraph on page 12, lines 5-16 with the following amended 
paragraph: 

Exa m ple Additional examples of variables and computations that can bo are 
tracked by the statistics/comparison device 418 in alternate embodiments are 
illustrated below. ¥^ For Example, to compute the statistic, one or more of t he 
following sums can bo are calculated between the template (T) and corresponding 
pixels in the image (I) in various embodiments : 
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1) El and ZT, the sums of the respective pixel values 

2) EIT, the sum of the product of the pixel values 

3) El 2 and ST 2 the sums of the squares of the respective pixel values 

4) Pixel Count, the number of pixels that have been accumulated 

It should be noted that computing these sums may dominate the runtime of the 
pattern recognition or other high-level algorithm that is using the metric. 

19. Please replace the paragraph on page 12, lines 17-25 with the following amended 
paragraph: 

Also, one embodiment uses a summing metric for template matching can bo 
uood that involves summing some function of the difference between the image and 
template pixels, for example: 



where 8 is some value less than 20 (for 8-bit unsigned integer pixel data). A more 
flexible variation involves computing ILUT(f(l,T)), the sum of a lookup based on a 
function of the pixel values. Two examples of f(l,T) are f(l,T)=l-T or f(l,T)=|l-T| (the 
difference and absolute difference of the pixel values, respectively). 

20. Please replace the paragraph on page 13, lines 1-10 with the following amended 
paragraph: 

As mentioned above, in one embodiment a transform con bo is applied to either 
the input image or the template, in order to find transformed versions of the template 
object. Typical transformations can bo include combinations of rotations, scales and 
perspective transforms that are relatively close to identity (to minimize the size of the 
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search space). All of the above-described variations share the characteristic that 
pixels from the template are iterated over pixels in the image, and a calculation is 
performed between corresponding pixels. The template is typically small compared to 
the image and static over a large number of template matching search probes. 

21. Please replace the paragraph on page 13, lines 11-17 with the following amended 
paragraph: 

Sovora l In further embodiments, several higher-level search strategies em 
fee are used to find the best transformation parameters for a given template. Qm 
For example, one such search strategy involves examining all possible 
combinations of parameters, e.g., oxamino examining the template at all possible 
positions, orientations, and scales. Seme =Further. some computational savings can 
be obtained by working in a multi-resolution hierarchy, i.e., to match a reduced size 
template to a reduced size image, and to then try local variations (perturbations) to 
find a better match. 

22. Please replace the paragraph on page 13, line 18, through page 14, line 2, with the 
following amended paragraph: 

Another A search strategy that can bo used in another embodiment involves 
taking derivatives of the matching cost with respect to the unknown parameters, and 
then using a generalized gradient descent technique. In this case, in addition to 
summing intensities, (threshold) intensity differences, and intensity products or 
squares (as in regular, normalized, or robust enhanced cross-correlation), also 
products of horizontal and vertical derivatives are accumulated with themselves and 
with the per-pixel difference. If transformations other than translation are being 
estimated, the number of required derivatives and products rise quickly. However, it 
is possible to amortize the computation of derivatives that are more complicated 
and their products by dividing the template up into smaller regions or patches, and 
only accumulating simpler derivatives and products on a per-pixel basis. 
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23. Please replace the paragraph on page 14, lines 3-12 with the following amended 
paragraph: 

An additional search strategy that can bo used in vet another embodiment is 
to let the unknown transformation parameters (or at least their updates) be 
controlled by the motion of vertices embedded in the template. For example, the 
template can be a wireframe mesh of an object that is being tracked. The control 
mesh can bo is then discretized into finer polygons, such as triangles, and the 
triangles surrounding each control vertex can bo are used to estimate that vertex's 
motion. For reasonably textured templates, convergence will occur to the same 
estimate as the full gradient descent techniques, which explicitly model the 
interaction between vertices controlling a common patch or triangle. 

24. Please replace the paragraph on page 14, lines 13-19 with the following amended 
paragraph: 

In addition, the alpha values , a, of the RGBA (red, green, blue, alpha) values 
in the input colors can bo are used in one embodiment to weight the statistics,. 
where a is a number between 0 and 1 . For example, if a T is the template alpha 
and oti the image alpha, then a new a can bo is derived from these values by either 
selecting one, or by €leia€ hperforminQ a conventional weighted blend between them. 
¥ke=ln one embodiment, the resulting a eeate is then fee=used to weight the pixel's 
contribution to the above-described statistics. Among other things, this would allow 
allows pixels to be importance-weighted by the application. 

25. Please replace the paragraph on page 14, line 22, through page 15, line 7 with the 
following amended paragraph: 

The following working example is for illustrative purposes only. FIG. 5 is a 
block diagram illustrating one embodiment of the present invention. In general, 
similar to FIG.4, the example system 500 of FIG. 5 includes a host processor 508, a 
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first memory device 512, such as a frame buffer, a second memory device 510, 
such as a texture memory device, and a graphics processor 513 such as a modified 
computer video or graphics card that includes an address generator 514, an 
acceptance tester 516 and a statistics/comparison processor 518. Tke =ln this 
example, the frame buffer 512 can bo is a block of graphics memory that represents 
a display for a computer screen and texture memory 510 can bo js a block of 
graphics memory that ca n bo may contain portions of the display screen. In 
addition, the example system 500 of FIG. 5 also includes a statistics enable switch 
520 and an alpha blending device 522. In this example, the graphics processor 513 
resamples either the first or the second set of data to be matched to each other using 
a conventional perspective transformation. Note that such perspective transforms are 
well known to those skilled in the art, and are described in Section 2.10 of "The 
QpenGL® Graphics System: A Specification (Version 1.2.1) " Copyright @ 1992-1999 
Silicon Graphics, Inc.. April 1, 1999. 

26. Please replace the paragraph on page 15, lines 8-20 with the following amended 
paragraph: 

P r oforab l y In one embodiment, this example system can bo uood ao provides 
a three-dimensional (3D) graphics rasterizer that can bo i ntogratod is modified as 
described above to form a novel video graphics device or hardware video card for 
computer systems. In this embodiment, the first set of digital data can bo js stored 
in the texture memory while the second set of data can bo is stored in the frame 
buffer. Also, statistical generation can bo is performed by the statistics/comparison 
device 518 with or without actually rendering or writing a 3D digital scene to the 
frame buffer with by routing the data to the statistics/comparison device via the 
statistics enable switch 520. For imp l omontation of tho Therefore, for implementing 
this system using a computer video graphics hardware device, the additional core 
logic represented by the statistics/comparison device 518 and the statistics enable 
switch 520 is can bo used to compute the statistics and to forward the results back to 
the host processor 508 upon request. FIG. 5 shows computations of the statistics 
between the texture map and the frame buffer image for tracking statistics on the two 
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input pixel values instead of blending between them (via the alpha blending device 
522) and writing the result to the frame buffe r 512 . 

27. Please replace the paragraph on page 1 5, line 21 , through page 1 6, line 2 with the 
following amended paragraph: 

Namely, when the statistics enable switch 520 is enabled, the graphics 
processor 513 renders the rasterized information (step 220 of FIG. 2) without writing 
the results to the frame buffe r 512 . In contrast, when the statistics enable switch 
520 is disabled, the graphics processor 513 actually renders or writes the rasterized 
information to the frame buffer and display screen. Conventionally, the alpha 
blending device 522 allows use of an additional (such as a fourth) color component 
that is not displayed, but that corresponds to the opacity of a surface. This provides 
control of the amount of color of a pixel in the source surface to be blended with a 
pixel in the destination surface. However, as noted above, in the context of the 
present invention, alpha values associated with pixels are instead used for 
weighting computed statistics. Consequently, when the statistics enable switch 520 
is enabled, the statistics/comparison device 518 uses conventional weighting 
methods for weighting the statistics relative to the alpha values associated with the 
pixels. 

28. Please replace the paragraph on page 1 6, lines 3-12 with the following amended 
paragraph: 

As noted above, it has been observed that textured triangle rasterization 
performed in a conventional graphics processor or the like closely resembles sparse 
matching of a template with an image. te =Conseguentlv. in one specific embodiment 
of the example of FIG. 5, the template is treated as a texture and the frame buffer an 
image and the display primitive for rendering purposes is a triangular polygon. In 
addition, instead of rasterizing the texture into the frame buffer, certain statistics =eaa 
fe© are recorded for normalized correlation or other statistics can be recorded for 
var i ationo various embodiments . In one example, if the texture is considered a 
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template and the frame buffer an image, the graphics processor 513 ©a^tee is used 
to resample the template using a perspective transformation. Also, the 
statistics/comparison device 518 of the modified graphics processor 513 can bo is 
used to record statistics (ST, SI, SIT, ST 2 , SI 2 for normalized correlation, or other 
statistics for a variation) for later forwarding to the host processor. 

29. Please replace the paragraph on page 16, lines 13-25 with the following amended 
paragraph: 

The example system 500 is extremely flexible because it has the ability to 
intersperse rendering/accumulation commands with accumulator readback 
commands. For example, if a deformable triangular patch is being tracked, the 
triangle can be rendered using a number of smaller triangles (such as 16), and the 
accumulator can read back after each of the small triangles has been rendered. 
This allows host processor 508 to compute necessary derivatives and sums to 
compute a full affine motion for the triangle. Similarly, if a fu l l oot of data larger 
template, potentially consisting of a number of triangles , is being tracked, each 
triangle's accumulated values can bo are read back in order to compute an 
independent motion of each control verte x for the template . The It has been 
observed that the number of readbacks per rendered model are few enough that 
they should not impose a large burden on a hardware graphics port of a computer 
system, such as the exemplary computer system depicted in FIG. 1. 

30. Please replace the paragraph on page 16, line 26, through page 17, line 12, with 
the following amended paragraph: 

Btko E Note that either the first or the second set of data to be compared with 
each other can be rendered at a number of offsets. This allows the host processor 
508 to either explicitly find the best position for the first set of data, such as the 
template, or accumulate the required information to analytically compute the best 
update. The offsets are preferably simple integer or fractional (such as Vz pixel) 
perturbations to the vertices. As such, it is preferable that the system 500 supports 
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differencing of the deformed data (such as the template) and the target data (such 
as the image) at a number of pixel or sub-pixel shifts. For example, in one 
embodiment, the host processor 508 can opocify specifies the shift amount (for 
instance c/=1 or d= 1 /4 pixel), to enable accumulation differences not only with the 
target data, but also with versions shifted by ±d pixels horizontally and vertically 
(accumulating 9 times the statistics). For software implementations, the 
speed/memory-hardware tradeoff is good, where the cost of rasterizing a single 
pixel is still several cycles. 

31 . Please replace the paragraph on page 17, lines 13-18 with the following amended 
paragraph: 

Another advantage of integrating the rasterization and matching stages (via 
the statistics/comparison device 518) is that the graphics hardware can perform is 
then capable of performing the visibility computation for 3D model-based tracking. 
The 3D model would be rendered once in order to compute the z-buffer, and then it 
would be rendered again to compute the (per-triangle) statistics. Rondorod Note 
that rendered pixels which fail the z-buffer test (i.e., the depth buffer test) w ould be 
discarded from the computation, since they are not visible. 

32. Please replace the paragraph on page 3, line 3-1 7 with the following amended 
paragraph: 

Display primitives, in this case triangles, are used to transform the template 
615 and locate it in the input image 630. For instance, the mapping of the triangles 
from template 615 to image 630 is shown by arrows 640, 645. Also, although two 
display p rimitives encompass the entire template 615 for rasterizing the template 
615, additional primitives can be used for rasterizing and they do not necessarily 
have to encompass the entire template 615. Further, any suitable transform can be 
used. For example, powerful transforms, such as perspective transforms, safe can 
be applied to the template, as well as the affine transform depicted in FIG. 6C. 
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